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Abstract 



We study compressive sensing in the spatial domain for target localization in terms of direction of 
;h ■ 
Oh' arrival (DOA), using multiple-input multiple-output (MIMO) radar A sparse localization framework is 



proposed for a MIMO array in which transmit and receive elements are placed at random. This allows to 
dramatically reduce the number of elements needed, while still attaining performance comparable to that 
of a filled (Nyquist) array. Leveraging properties of structured random matrices, we develop a bound on 
the coherence of the resulting measurement matrix, and obtain conditions under which the measurement 
'^ I matrix satisfies the so-called isotwpy property. The coherence and isotropy concepts are used to establish 

uniform and non-uniform recovery guarantees for target localization using spatial compressive sensing. 
In particular, non-uniform recovery is guaranteed if the product of the number of transmit and receive 

' 2 

OO ' elements MN, namely, the number of degrees of freedom, scales with K (log G) , where K is the 

!>■ 

Xf^ ' number of targets, and G is proportional to the array aperture and determines the angle resolution. The 

^' 

^' 

f^ i with a small number of MIMO radar elements. This is in contrast with a filled virtual MIMO array 

cn 



significance of the logarithmic dependence in G is that the proposed framework enables high resolution 
with a small number of MIMO radar elements 
where the product MN scales linearly with G. 



Index Terms 

Compressive sensing, MIMO radar, random arrays, direction of arrival estimation. 

I. Introduction 

Detection, localization, and tracking of targets are basic radar functions. Limited data support and low 
signal-to-noise ratios (SNR) are among the many challenges frequently faced by localization systems. 
Another challenge is the presence of nearby targets, in terms of location or Doppler, since closely spaced 
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targets are more difficult to discriminate. In multiple-input multiple-output (MIMO) radar, targets are 
probed with multiple, simultaneous waveforms. Relying on the orthogonality of the waveforms, returns 
from the targets are jointly processed by multiple receive antennas. MIMO radar is typically used in two 
antenna configurations, namely distributed [1] and colocated HI. Depending on the mode of operation and 
system architecture, MIMO radars have been shown to boost target detection, enhance spatial resolution, 
and improve interference suppression. MIMO radars achieve these advantages by capitalizing on a larger 
number of degrees of freedom available than in "conventional" radar. 

In this work, we focus on the application of MIMO radar to direction-of-arrival (DOA) estimation. It is 
well known in array signal processing |l3] that DOA resolution improves by increasing the array aperture. 
Increasing the array aperture without increasing the number of sensors may lead to ambiguities, i.e., a set 
of measurements can be explained by erroneous sets of locations. A non-ambiguous uniform linear array 
(ULA) must have its elements spaced at intervals no larger than A/2, where A is the signal wavelength. 
For a MIMO radar, unambiguous direction finding of targets is possible if N receive elements are spaced 
A/2, and M transmit elements are spaced NX/2, a configuration known as virtual ULA \2\. In sampling 
parlance, the A/2-spaced array and the MIMO virtual ULA perform spatial sampling at the Nyquist rate. 
The main disadvantage of this Nyquist setup is that the product of the number of transmit and receive 
elements, MN, i.e., the degrees of freedom of the system, needs to scale linearly with the array aperture, 
and thus with resolution. 

In this paper, we propose the use of a sparse, random array architecture in which a low number of 
transmit/receive elements are placed at random over a large aperture. This setup is an example of spatial 
compressive sensing since spatial sampling is applied at sub-Nyquist rates. The goal of spatial compressive 
sensing is to achieve similar resolution as a filled array, but with significantly fewer elements. 

Localizing targets from undersampled array data links random arrays to compressed sensing |[4|. 
Random array theory can be traced back to the 1960's. In [5 1, it is shown that, as the number of sensors is 
increased, the random array pattern, a well known quantity to radar practitioners, converges to its average. 
This is because the array pattern's variance decreases linearly with the number of elements. This work 
was extended to MIMO radar in 0. The main conclusion of the classical random array literature was 
that the random array pattern can be controlled by using a sufficient number of sensors. However, two 
fundamental questions were left pending: How many sensors are needed for localization as a function 
of the number of targets, and which method should be used for localization? Here we suggest that the 
theory and algorithms of compressed sensing can be used in order to address those questions. 

Early work on compressive sensing radar emphasized that the sparse nature of many radar problems 
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supports the reduction of temporal as well as spatial sampling (an overview is given in Q). Recent work 
on compressive sensing for single-input single-output radar lISl- llTTI demonstrates an increased resolution 
or a reduction in the temporal sampling rate. Compressive sensing for MIMO radar has been applied both 
on distributed [12] and colocated |[T3l setups. Much of the previous literature on compressive sensing for 
colocated arrays discusses the ULA setup, either within a passive system (with only receive elements) |fT5l| 
or in a MIMO radar |[T3l setup. In contrast, we are interested in spatial compressive sensing which relies 
on a random array geometry. Links between compressive sensing and random arrays have been explored 
in |fT6ll . showing that spatial compressive sensing can be applied to the passive DOA problem reducing the 
number of receiving elements. However, the MIMO radar framework poses a major challenge: contrary 
to the passive setup, where the rows of the sensing matrix A are independent, the MIMO radar MN 
measurements are not independent, as they conform to the structure of the MIMO random array steering 
vector This prevents the application of the vast majority of the results in the literature. A MIMO radar 
random array architecture is studied in [14], but no recovery guarantees are provided. 

Low rate spatial sampling translates into cost savings due to fewer antenna elements involved. It 
is of practical interest to determine the least amount of elements required to guarantee correct targets 
recovery. Finding conditions that guarantee correct recovery has been a main topic of research and one of 
the underpinnings of compressive sensing theory. Recent work has shown that, for a sufficient number of 
independent and identically distributed (i.i.d.) compressive sensing measurements, non-uniform recovery 
can be guaranteed if a specific property of the random sensing matrix A, called isotropy, holds ifTTl . 
While this property plays an important role, this result does not apply to our setup since the MIMO radar 
MN measurements are not independent. The non-independent rows problem was recently addressed in 
ifTSl . where the authors derived conditions for non-uniform recovery using spatial compressive sensing 
in a MIMO radar system with N transceivers. This architecture is a special case, in which the positions 
of transmitters and receivers of a MIMO radar coincide. 

This work expands the literature in several ways. Specifically, we propose a sparse localization frame- 
work for a MIMO random array for a general setup of M transmitters and N receivers. We provide a 
bound on the coherence of the measurement matrix, and determine under which conditions the isotropy 
property holds. This allows us to develop both uniform and non-uniform recovery guarantees for target 
localization in MIMO radar systems. Preliminary results on the work presented here were published in 
our conference publication ||T9l. The proposed MIMO random array framework is of practical interest 
to airborne and other radar applications, where the spacing between antenna elements may vary as a 
function of aspect angle towards the target, or where exact surveying of element locations is not practical 
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due to natural flexing of the structures involved. Our results show that one can obtain high-resolution 
with a reduced number of measurements. 

The paper is organized as follows: Section JI] introduces the system model, and the proposed sparse 
localization framework. In Section JIlJ we discuss spatial compressive sensing. Recovery guarantees for 
the proposed spatial compressive sensing framework are derived in Section |IVl In Section |Vl we present 
numerical results demonstrating the potential of the proposed framework, followed by conclusions in 
Section |Vll 

The following notation is used: boldface denotes matrices (uppercase) and vectors (lowercase); for a 
vector a, the i-th index is a^, while for a matrix A, the i-th row is denoted by A {i, :). The complex 
conjugate operator is (•)*, the transpose operator is (•) , and the complex conjugate-transpose operator 
is (•) . We define a K-sparse matrix to have only K non-zero norm rows, the number of non-zero norm 
rows of X is ||X||q and the support of X collects the indices of the non-zero norm rows. The operator 
E denotes expectation and we define t/^x (u) = E [exp {jxu)] as the characteristic function of the random 
variable x. The symbol "(g)" denotes the Kronecker product. The notation x ~ CM {fi, C) means that the 
vector X has a circular symmetric complex normal distribution with mean fi and covariance matrix C. 
We denote with Ka (•) the modified Bessel function of the second kind. 

II. System Model 
A. MIMO Radar Model 

We model a MIMO radar system (see Fig. III-AI ) in which A^ sensors collect a finite train of P pulses 
sent by M transmitters and returned from K stationary targets. We assume that transmitters and receivers 
each form a (possibly overlapping) linear array of total aperture Ztx and Zrx, respectively, where Ztx 
and Zrx are normalized in wavelength units. Defining Z = Ztx + Zrx, the m-th transmitter is at 
position Z^m/2 on the x-axis, while the n-th receiver is at position ZC„/2. Here ^m lies in the interval 
[ — ^, — §^], and C,n is in [ — |^, ^^]. This definition ensures that when Ztx = Zrx, both ^m and 
C,n are confined to the interval [— ^, ^], simplifying the notation in the sequel. 

Let Sm (t) denote the continuous-time baseband signal transmitted by the m-th transmit antenna and 
let 9 denote the location parameter(s) of a generic target, for example, its azimuth angle. Assume that the 
transmitted probing signals are narrowband and that the propagation is nondispersive. Then, the baseband 
signal at the target location, considering the p-th transmitted pulse, can be described by (see, e.g., ||T]) 

M 

5^ exp {j2TrfoTm (9)) Sm {t - pT) ^ J {6) s (t - pT) . (1) 

m=l 
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Fig. 1. MIMO radar system model. 



Here /o is the carrier frequency of the radar, r^ (^) is the time needed by the signal emitted by the 



m-th transmit antenna to arrive at the target, s it) = \s\ (t) 
interval, and 



, SM it)\ , T denotes the pulse repetition 



c (9) = [exp (j27r/ori {9)) , . . . , exp iJ27rfoTM (9))]' 



(2) 



is the transmit steering vector. Assuming that the transmit array is calibrated, c {6) is a known function 
of 6*. 

To develop an expression for the received signal r„ (t) at the n-th receive antenna, let 



b (9) = [exp {j27Tfon m , . . . , exp (i2^/of^ {9))Y 



(3) 



denote the receive steering vector where f„ (9) is the time needed for the signal reflected by the target 
located at 9 to arrive at the n-th receive antenna. Define the vector of received signals as r (i) = 
[ri (t) , . . . , rjv (t)] . Under the simplifying assumption of point targets, the received data vector is 
described by |jT] 

K P-l 

r{t) = J2Yl ^'^> (^'^) ^^ (^'^) s (t - pT) + e (t) (4) 

fc=i p=i 

where K is the number of targets that reflect the signals back to the radar receiver, x^ p is the complex 
amplitude proportional to the radar cross sections of the k-th target relative to pulse p-th, 9j^ are locations, 
P is the number of pulses over which the targets are observed, and e (t) denotes the interference plus- 
noise term. The targets' positions are assumed constant over the observation interval of P pulses. We 
assume that the target gains {x^ j,} follow a Swerling Case II model, meaning that they are fixed during 
the pulse repetition interval T, and vary independently from pulse to pulse EDI . 
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B. Problem Formulation 

The purpose of the system is to determine the DOA angles to targets in a common range bin. The 
assumption of a common range bin impUes that all waveforms are received with the same time delay after 
transmission. Since range measurements are not of interest, the common time delay is not explicitly shown 
in our model. Targets are assumed in the far-field, meaning that a target's DOA parameter 9 = sini9 
(where -d is the DOA angle) is constant across the array. Under these assumptions, the receiver and 
transmitter steering vectors, b {6) and c {6) respectively, become 



b {9) = [exp (jVZ^Ci) , . . . , exp (ivrZ^C 



N) 



and 



c [9) = [exp {j^Z9ii) , . . . , exp (JTrZ^a/)]' 



(5) 



(6) 



By cross-correlating the received signal at each sensor with filters matched to each of the probing 
waveforms, we obtain 



yp 



vec 



vec 



/ r (t) s^ (t - pT) dt 

+ / e (t) s^ {t - pT) dt 



(7) 



where the M x M matrix W has elements 



[Wl 



m,] 



Sm (t) S* (t) dt. 



(8) 



We assume the M probing waveforms to be orthogonal (e.g., pulses modulated by an orthogonal code), 
therefore W = I. Defining the AIN x P matrix Y = [yi, . . . , yp], we have from (jT]), 



Y = A (6*) X + E 
where X = [xi, . . . , xp] is a K x P matrix with Xp = [xi_p, . . . , XK,p] , 

Ai9) = [8.{9i),...,^{9K)] 

is a MN X K matrix with columns 

s,{9)^c{9)®h{9) 



(9) 



(10) 



(11) 



known as the "virtual array" steering vector, and E = [ei, . . . , ep] is MA^xP with ep = vec [J e (t) s^ (t — pT) dt\ . 
The term "virtual array" indicates that a {9) can be thought of as a steering vector with MN elements. 
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The aim is to recover and X from Y using a small number of antenna elements. In order to do 
this, we use a sparse localization framework. Neglecting the discretization error, it is assumed that the 
target possible locations 9 comply with a grid of G points ^i-^ (with G » K). Since each element 
of 9 parameterizes one column of A {9), it is possible to define an MN x G dictionary matrix A = 
[ai, . . . , a^], where a^ = a (i;^^). The steering vector a^ is the Kronecker product of the receive steering 
vector hg = h{(l)g) from (|5]l, and the transmit steering vector Cg = c{(pg), from ^: 



The received signal is then expressed as 



ag = Cg (» b„. (12) 



Y = AX + E (13) 



where the unknown G x P matrix X contains the target locations and gains. Zero rows of X correspond 
to grid points without a target. The system model (IT3T i is sparse in the sense that X has only K <^ G 
non-zero rows. 

Note that in the sparse localization framework, the matrix A is known, whereas in the array processing 
model ^, the matrix A [9) is unknown. Therefore, the goal translates to determining the non-zero 
rows indices, i.e., the support of X, from the knowledge of the measurements Y and matrix A. The 
measurement matrix A is governed by the choice of grid points <J)i;G, by the number M of transmitters 
and their positions, ^i:A/, and by the number N of receivers and their positions, Ci;iv- In the following 
we assume that the transmitter (receiver) elements' positions £^i-m {Ci-.n) are independent and identically 
distributed (i.i.d.) random variables governed by probability density function (pdf) p (^) {p {(,)). Our aim 
is to provide conditions on p(^), p{C), M, N and 4>i-g in order to obtain recovery guarantees using 
practical algorithms. 

III. Spatial Compressive Sensing Framework 

The aim of spatial compressive sensing is to recover the unknown X from the measurements Y (see 
(IT3] )) using a small number of antenna elements. 

A. Beamforming 

Consider the scenario in which the transmitters and receivers locations support the Nyquist array 
("virtual ULA") geometry. In this setting, the matrix A in ( |T3] ) has a Vandermonde structure, and the 
aperture scales linearly with the number of antenna elements, Z = {MN — 1) /2. If we choose the 

April 17, 2013 DRAFT 



(unifonn) grid of possible target locations (j)g to match the array resolution, that is G = 2Z + 1, then 
the matrix A becomes a Fourier matrix. Due to the unitary property of the Fourier matrix, the Gram 
matrix Q = A^A can be written as a scalar multiple of the identity matrix, Q = MN ■ I, from which 
it follows that X can be estimated as X = {1/MN) ■ A^Y. In array processing, this method is called 
beamforming. The support of the unknown X is recovered by looking for peak values of ||a^Y|| over 
the grid points. Beamforming is also applied to estimate the locations of targets not limited to a grid, by 
finding the peaks of ||a^ (0) Y|| , where a {0) is a steering vector dTTT l swept over the angles of interest. 
The shortcoming of the Nyquist array setup is that the number of elements MN must scale linearly with 
the array aperture Z and thus with the resolution, i.e., such sampling mode requires MN = G. 

The spatial compressive sensing paradigm implies that a sparse X can be recovered from a number of 
spatial measurements MN significantly lower than the Nyquist array ("virtual ULA"), i.e., MN <^ G. 
The idea is to design the sensing procedure so that the matrix Q is a scalar multiple of the identity 
matrix on average^ i.e., E [Q] = MN ■ I, and to control the variance of the non-diagonal elements by 
using a sufficient number of measurements MN. Intuitively, the more measurements MN we employ, 
the closer we get to a diagonal Q. Furthermore, because when MN < G, each realization of Q has 
non-zero off-diagonal terms, the beamforming metric ||a^Y|| is affected not only by the g-th row of X 
and by the noise, but also by any row of X that has non-zero norm. This entails that more sophisticated 
recovery algorithms, which take into account the mutual interference among non-zero rows of X, must 
be employed instead of beamforming. A brief overview of compressive sensing recovery methods is 
provided next. 

B. Compressive Sensing 

One way to classify compressive sensing models is according to the number of pulses P ("snapshots" 
in array problems): single measurement vector (SMV) when P = 1 and Y reduces to a single vector, 
multiple measurement vector (MMV) for P > 1, when Y is a matrix. The system model in (|T3] ) is an 
example of an MMV setting. For simplicity, in the following discussion, we consider an SMV scenario 
(i.e., P = 1, Y = y, X = X and E = e in ^). 

In principle, a sparse x (i.e., it has only K <^ G non-zero rows) can be recovered from the least 
number of elements MN by solving the non-convex combinatorial ^o-norm problem 

minllxlln s.t. Ily — AxIL < o" (14) 

'For instance, this is obtained when using a partial Fourier matrix. 
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1 1 9 

or one of its equivalent formulations: a cardinality-constrained formulation, mirix ||y — Ax||2 s.t. ||x||q < 

111 1 1 2 

7, or a Lagrangian formulation, mirix 2 ||y — -A-x||2 + v ||x||q. These three formulations are equivalent 
for a proper choice of the parameters 7, v and a, which depend on prior information, e.g. the noise level 
||e||2 or the sparsity K. Unfortunately, the solution to any of these formulations requires an exhaustive 
search among all combinations of non-zero norm indices of x, necessitating exponential complexity |4|. 
A variety of polynomial complexity algorithms have been proposed for obtaining an approximate 
solution to (IT4I) . One family of methods is Matching Pursuit (MP). In its simplest version, an empty 
provisional support is refined by adding one grid-point index at each iteration. Among the matching 
pursuit algorithms, the most notable are Orthogonal Matching Pursuit (OMP) |[2TI . Orthogonal Least 
Squares (OLS) ||22l, and CoSaMP 111, which address the SMV setting, the Rank Aware-Orthogonal 
Recursive Matching Pursuit (RA-ORMP) algorithm |24] and its generalization Multi-Branch Matching 
Pursuit (MBMP) f25l|, both capable of addressing the general MMV setting. Another family of methods 
is known as Basis Pursuit (BP). The BP strategy relaxes the /o-norm in (IT4] | with the Zi-norm [26|. The 
reformulation is known as complex LASSO, defined as 

inin||x||i s.t. Ily — AxIL < o". (15) 

Unlike (IT4l ). this problem is convex, and a global solution can be found in polynomial time. Being a 
relaxation, the solution obtained could be different from the original. Finding conditions that guarantee 
correct recovery with a specific method (e.g. LASSO) has been a main topic of research and one of the 
underpinnings of compressive sensing theory |@]. 

Two kinds of recovery guarantees are defined in compressive sensing: uniform and non-uniform. A 
uniform recovery guarantee (addressed below by Theorem means that for a fixed instantiation of the 
random measurement matrix A, all possible K-sparse signals can be recovered with high probability. In 
contrast, a non-uniform recovery result (addressed by Theorem |4]) captures the typical recovery behavior 
for a random measurement matrix A. Specifically, suppose we are given an arbitrary K-sparse vector x, 
and we then draw A at random (independent of x). Non-uniform recovery details under what conditions 
an algorithm will recover x with high probability. Note that, for a non-uniform guarantee, A is being asked 
to recover only a specific x, not any K-sparse vectors. Therefore, uniform recovery implies non- uniform 
recovery, but the converse is not true. 

Loosely speaking, a uniform recovery guarantee can be obtained if, with high probability, the matrix A 
has small coherence [4|. The coherence is defined as the maximum inner product between the normalized 
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columns of A, 

// = max-— i-r — . (16) 

Alternatively, uniform recovery can also be obtained if A satisfies the Restricted Isometry Property (RIP) 
Bll with high probability. Non-uniform recovery can be guaranteed if a specific property of the random 
measurement matrix A, called isotropy, holds ifTTl . The isotropy property states that the components of 
each row of A have unit variance and are uncorrected, i.e., 

E[A^(t,:)A(t,:)]=I (17) 

for every t. 

Both (IT6l ) and dTV] ) suggest that the matrix Q = A^A plays a key role in establishing recovery 
guarantees. Indeed, because in our setting the rows of A are identically distributed, a simple calculation 
shows that E [A-^^ (t, :) A (t, :)] = jg^E[Q], thus the isotropy property requires E [Q] = MN ■ I. 
Furthermore, as evident by the definition of coherence in ([T6] ). ^ is recognized as the normalized maximum 
off-diagonal value of |Q|. In Section |IVl by deriving statistics of the matrix Q, we characterize uniform 
and non-uniform recovery guarantees for spatial compressive sensing. 

IV. Recovery Guarantees 

In this section, we develop recovery guarantees for sparse localization with MIMO random arrays. In 
detail, we show how to choose the grid-points 4>i;G, the number of elements MN and the distributions 
governing the element positions p (^) and p(C)> in order to guarantee target localization by spatial 
compressive sensing via (ITST l. Due to the role of Q in recovery guarantees, we start by studying the 
statistics of Q. 

A. Statistics of Q = A^ A 

To study the statistics of Q, we first analyze its relationship to a quantity known to radar practitioners 
as the array pattern ||27l . In array processing, the array pattern /3 (uj,/) is the system response of an array 
beamformed in direction 0/ to a unit amplitude target located in direction 4>i. In other words, /3 (uj,/) is 
the inner product between two normalized columns of the measurement matrix: 

/3K0= II f^' II (18) 

M N 

= MiV Zl Zl ^^P t^'^^'' (^" + ^-)] 

m=l 71=1 
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where we defined 

Ui^i = irZ {4)1 - cpi) . (19) 

The peak of the absolute value of the array pattern for a target colinear with the beamforming direction, 
1/3 (0)|, is called the mainlobe. Peaks of |/3 (m)| for n 7^ 0, are known as sidelobes, and the highest among 
all the sidelobes is called the peak sidelobe. Thus the terms a^^a^ in (|T8] ) play the role of sidelobes. 

The relation between coherence, isotropy and array pattern is apparent. Indeed, from (IT6] l. (ITST i and 
the definition of sidelobes, in array processing parlance, the coherence is the peak sidelobe associated 
with the matrix A. Similarly, from dTTT l and (ITST l. the isotropy can be related to the mean array pattern 

r?(ni,0=E[/3K0], (20) 

where the expectation E [/3 (uj,/)] is taken with respect to the ensemble of element locations. In particular, 
isotropy requires that f] (ui^i) = for any i y^ I. 

For a system with randomly placed sensors, the array pattern /3 (ui^i) is a stochastic process. Naturally, 
statistics of the array pattern of a random array depend on the pdf of the sensor locations. Analysis 
of the MIMO radar array pattern /? (uij) can be found in 16], where the means and the variances of 
the real and imaginary parts of (i {ui^i) were discussed. The following proposition formalizes pertinent 
results from ||6]. For the sake of brevity, we drop the dependency on i and /, and denote the array pattern 
/3 (u). Define z = C, + £^, and assume that the pdf of z, p (z), is an even function, so that Imr/ (u) = 0. 
Further define the variances of the array pattern af (u) = var [Re/3 (n)], a2 (u) = var [Im/3 (u)] and 
ai2 (u) = E [(Re/3 (-u) - r/ (n)) Im/3 (u)]. 

Proposition 1. Let the locations ^ of the transmit elements be Ltd., drawn from a distribution p(^), and 
the locations Q of the receive elements be i.i.d., drawn from a distribution p [Q). Then, for a given u, the 
following holds: 

1) The mean array pattern is the characteristic function of z, i.e., 

ri{u)=^,{u). ill) 

2) If £, and C, are identically distributed, then d22D . d2JP and (T12 {u) = hold. 

Proof: See Appendix A. ■ 

Proposition [T] links the probability distributions p {$) and p {(,) (via ijjz (u) and ip (n)) to the mean and 
variances of each element of the matrix Q, i.e., /3 (ui^) = ■jjj^afai. As shown below, this result is used 
to obtain non-uniform recovery guarantees. 
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2. ^ 1 



' (") = ^MN ^^ + "^^ ^^"^^ + "^^ ^"^ 



N+M-2 N+M-1 

(1 + V« (2m)) - ^Z-z (tt) 



2M7V ^ ■ -r-^^ " -- V ' ^jv 



(22) 



'"' ('') = 2ii^ (^ " ^^ ^2")) + ^^ ('') ^^MN ^ (^ + '^^ (2"^) ^^^^ 



To characterize the statistics of the coherence fi, from (IT6] ). we need the distribution of the maximum 
over all non-diagonal elements of |Q|. 

We now show that, by imposing specific constraints on the grid points (^i-g and on the probability 
distributions p{C) and p{C), we can characterize the distributions of the elements of Q. To do this, we 
require an intermediate result about the structure of the matrix Q when (pi^c is a uniform grid: 

Lemma 1. If (pi-c is a uniform grid, then Q has a Toeplitz structure. 

Proof: See Appendix B. ■ 

Thanks to Lemma [U whenever (I)i-g is a uniform grid, Q is described completely by the elements of 
the first row of A, a^^a^ for i = 1, . . . , G. From the definition of A, its columns all have squared-norm 
equal to MN. Therefore the elements on the main diagonal of Q are equal to MN. Thus, we need to 
investigate the remaining random elements, a{^aj for z = 2, . . . , G. By exploiting the Kronecker structure 
of the columns of A in (fT2l) . we can express the elements of Q, as: 

afa, ^(cf »bf)(c,»b,) 

= cf c,bf b,- (24) 

where b and c are the steering vectors of the receiver and transmitter arrays, respectively. 

From (l24l ). the random variable j3 (ni j) = jjj^sJ^aLi, is the product between the random variables 
Pc (^i.i) — ;^b(^bj and /3^ (ui,i) — ;^c{^Cj. As such, the distribution of (3 {ui^i) (or equivalently that of 
af^aj) can be characterized from the distributions of /3^ (mi,j) and (3^ {ui,i)- Following the approach in fS], 
in Appendix C we show that the real and imaginary parts of /3^ (ifi,j) have an asymptotic joint Gaussian 
distribution, and the same holds for /3^ {ui^i). In general, the variances of real and imaginary parts of 
I^C (^i,i) (or Pi i'^i.i)) are not equal. Interestingly, a closed form expression for the cumulative density 
function (cdf) of the product of /3^ (ui,i) and /3^ (ui,j) (i.e., the cdf of j^^^af^aj) exists in the special case 
when var [Re/3^ {ui,i)\ = var [Im/3^ (""i.i)] and var [Re/3^ (""i.i)] = var [Im/3^ (""i,?)]- In the following 
theorem, by meeting these conditions, we derive an upper bound on the sidelobes' complementary cdf 
(ccdf), i.e., Pr (;jgjy |^i^^i| > q)' and show that sidelobes have uniformly distributed phase. 
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We address two MIMO radar setups: (1) M transmitters and N receivers, where ^ and C, are indepen- 
dent, and (2) A^ transceivers, where ^n = Cn for all n and M = N. 

Theorem 1. Let the locations ^ of the transmit elements be drawn i.i.d. from a distribution p{S,), and 
the locations (^ of the receive elements be drawn i.i.d. from a distribution p {Q. Assume that p {(,), p (C) 
and the uniform grid (pi-.c are such that the transmitter and receiver characteristic functions satisfy 

ip^ (ni,i) = Vg (2ni,i) = ipc_ (ni,i) = Vc (2ni,i) = (25) 

for i = 2 to G, where ui^i = txZ {(pi — (pi). Then for i = 2, . . . ,G: 
I) If S, and C are independent: 

Pr (^ |af a, \ > q) < x ■ Ki{x) , (26) 



where x = 2VMNq. 
2)If(,n = Cn for all n: 



Pr ( ^ |af a^l > g ) < exp {-Nq) . (27) 



3) In both scenarios, the phase of sl^sli is uniformly distributed on the unit circle, i.e., 

Zaf aj ~ U [0, 27r] . (28) 

Proof: See Appendix C. ■ 

This theorem characterizes the distribution of ^g^a^^aj for both the M transmitters and N receivers 

setup and the N transceivers setup. In subsection IV.D, we provide a practical setup that satisfies ( [25l ). As 

shown in the next subsection, this allows to obtain a uniform recovery guarantee for spatial compressive 

sensing. 

B. Uniform recovery 

Theorem [1] enables to characterize the statistics of the coherence of A, defined in ( fT6l ). The following 
corollary of Theorem [T] bounds the probability that the matrix A has high coherence, or, equivalently, 
the probability of a peak sidelobe: 

Corollary 1. Let the locations of the transmit elements S, be drawn i.i.d. from a distribution p{^), and 
the locations of the receivers (^ be drawn i.i.d. from a distribution p{C)- Assume that the distributions 
p (S,) and p (C) and the uniformly spaced grid-points (pi-.c <^f^ such that f |25D holds for i = 2 to G. Then: 
1) If i and C, are independent: 

Pr (^ > g) < 1 - [1 - X • i^i (x)]^"^ , (29) 

April 17, 2013 DRAFT 



14 



where x = 2^/MNq. 
2) If ^n = Cn for all n: 



Pr(/i>g) < 1- [l-exp(-iVg)]^"^ (30) 



Proof: See Appendix D. ■ 

Similarly to the discussion after Theorem [T] since jjl can be interpreted as the peak sidelobe of the 
array pattern, (|29] l and (|30^ characterize the probability of having the peak sidelobe higher than q, in 
a system with M transmitters and N receivers, or N transceivers, respectively. These results are not 
asymptotic (i.e., they do not need the number of measurements M and N to tend to infinity), on the 
contrary, as shown in numerical results, it is already tight for M = N = lb elements, and for a small 
number of grid-points G. 

The coherence // plays a major role in obtaining uniform recovery guarantees for compressive sensing 
algorithms, as well as guaranteeing the uniqueness of the sparsest solution to (IT4l ). For instance, using the 
coherence /x, it is possible to obtain a bound on the RIP constant, 5k < {K — 1) // |[28l . which ensures 
stable and robust recovery by Zi -minimization (i.e., using (fTSl) ) from noisy measurements. In particular, 
by building on Corollary [T] the following theorem establishes the number of elements MN needed to 
obtain uniform recovery with high probability using (ITS) : 

Theorem 2 (Uniform recovery guarantee). Let the locations ^ of the transmit elements be drawn i.i.d. 
from a distribution p{(,), and the locations ^ of the receivers be drawn i.i.d. from a distribution p{C)- 
Let the distributions p (^) and p (C), and the uniform grid fpi-.G be such that relations ( 1251) hold for i = 2 
to G. Further, let 



MN >G[K-- 



log7+ 2l°g(21og7) 



2 



(31) 



where 7 = ^pT^Gj (2e), and the constant G = (43 + Vl\fl^ /16 ~ 4.6718. Then, with probability at least 
1 — e, for any K -sparse signal x € C^ measured from MN MIMO radar measurements y = Ax + e, 
with ||e||2 < CT, the solution x of di5D satisfies 



||x — XII2 < CO" (32) 

where c is a constant that depends only on e. 

Proof: See Appendix E. ■ 

The significance of (OTT ) is to indicate the number of elements necessary to control the peak sidelobe, 

and this is used to obtain a uniform recovery guarantee for spatial compressive sensing. In addition, the 
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previous theorem ensures exact recovery of any /f -sparse signal using dTSl ) in the noise-free case a = 0. 

It is important to point out that the number of grid points G is not a free variable, because the grid 
points 4>i-G must satisfy (|25] ). This point will be explored below, where we show that the resolution G 
must be linearly proportional to the "virtual" array aperture Z. 

Uniform recovery guarantees capture a worst case recovery scenario. In other words, the average 
performance is usually much better then that predicted by uniform recovery guarantees. In the following 
section, we show that if we consider a non-uniform recovery guarantee, then the zero mean conditions 
(I25] ) can be relaxed, and we obtain recovery guarantees that scale linearly with K. 

C. Non-uniform recovery 

In this subsection, we investigate non-uniform recovery guarantees. In recent work IITtI . it has been 
shown that, for a sufficient number of i.i.d. compressive sensing measurements, non-uniform recovery 
can be guaranteed if isotropy holds. However, the result in ifTTl cannot be directly used in our framework 
since the MN rows of the matrix A, following ([TT]) . are not i.i.d. The scenario when the measurements 
(rows of the matrix A) are not i.i.d. is addressed in [18], where non-uniform recovery is guaranteed for 
a MIMO radar system with N transceivers if the isotropy property (under the name aperture condition) 
holds. The following theorem derives conditions on grid points (pi-c and probability distributions p {£) 
and p {Q, for the random matrix A to satisfy the isotropy property: 

Theorem 3. Let the locations ^ of the transmit elements be drawn i.i.d. from a distribution p{S,), and 
the locations (^ of the receivers be drawn i.i.d. from a distribution p{C)- ^or every t, the t-th row of A 
in USj satisfies the isotropy property / |i7l/ , i.e., 

E[A^(t,:)A(t,:)] =1, (33) 

iff p (^), p (C) and (pi-c are chosen such that 

i^z{ui,i) = yi^l (34) 

where z = Q + ^ and ui,i = -kZ (cpi — (pi). 

Proof: See Appendix F. ■ 

Theorem |3] links the probability distributions p(^) and p{C) (through the characteristic function of z) 

and the grid-points (pi-c with the isotropy property of the matrix A. When ( [34b holds, it can be shown 
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that the aperture condition used in [18] holds too. Therefore, using the same approach as in lITSl . non- 
uniform recovery of K targets via (ITST i can be guaranteed in the proposed spatial compressive sensing 
framework. The following Theorem customizes Theorem 2.1 in ifTSl to our framework: 

Theorem 4 (Non-uniform recovery guarantee). Consider a K-sparse x € C*^ measured from MN MIMO 
radar measurements y = Ax + e where ||e||2 < o", let e > Q be an arbitrary scalar, and suppose that 
the random matrix A satisfies the isotropy property, E [A^ (t, :) A (t, :)] = I Vi. Then with probability 
at least 1 — e, the solution x to ( |75l ) obeys 



K 

MN 



|X-X||2 < ClCr^/^^-;^ (35) 



provided that the number of rows of A meets 

MN>CK\og^(—\ (36) 

where Ci, C and c are constants. 

Proof: The theorem results from Theorem 2.1 in lUSl by performing the following substitutions: K 
for s (sparsity), MN for n? (number of rows of A), and G for N (number of columns of A). Since in 
this work we consider ii'-sparse signals, in (l35T l. we discarded the term that accounts for nearly-sparse 
signals present in ifTSl . ■ 

Theorem |4] shows that, when the isotropy property is satisfied, the proposed framework enables to 
localize K targets using about MN = K (log G) MIMO radar noisy measurements. 



Some comments are in order First, it is important to stress that in (1361 ) the number of sensors scales 
linearly with the sparsity K as opposed to uniform recovery bounds based on coherence (e.g. (OTI l) which 
scale quadratically with K. Moreover, the significance of the logarithmic dependence in G is that the 
proposed framework enables high resolution with a small number of MIMO radar elements. This is in 
contrast with a filled virtual MIMO array where the product MN scales linearly with G. Again, it is 
crucial to point out that the number of grid points G is not a free variable, because the grid points 0i:G 
must satisfy (l34l ). Second, differently from (l35T l. in (|32] | the error did not depend on K, M and N. Third, 
(|35] ) shows that reconstruction is stable even when the measurements are noisy. Additionally, it can be 
noticed from (|35] ) that when cr = 0, Theorem |4] guarantees exact reconstruction with high probability, 
when (l36l ) holds. Furthermore, we want to point out that both results above could be extended to consider 
approximately sparse vectors, in which case an extra term appears in the right hand-side of both (|32] i 
and (|35] ). This situation may emerge when targets are not exactly on a grid, however, the analysis of the 
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off-grid target scenario is outside the scope of this paper. Finally, we would like to provide some intuition 
to the recovery guarantees. To this end, notice that both (|22]) and (1231 ) scale with 1/ {2MN). As such, 
recovery can be guaranteed by requiring the matrix A to satisfy the isotropy property, E [Q] = MN • I, 
and by controlling the variances of the non-diagonal elements of Q through the use of a sufficient number 
of measurements MN. 

D. Element locations and grid-points 

We now provide an example of choices of p{^), p (C) and (pi-c that meet the requirements of Theorem 
[T] and Theorem |3] 

The conditions needed by each theorem constraint the characteristic function of the random variables 
^, C- Let, Ztx = Zjix = Z/2, such that the random variables ^ and C are both confined to the interval 
[— ^, ^]. The characteristic function of a uniform random variable C ~ ^ [~^i |] is the sine function, 

i.e., 

, , sin (u/2) 

Therefore, when C, is uniformly distributed, by choosing (pi-c as a uniform grid of 2/Z-spaced points in 
the range [—1, 1], we have that V'c (""i,;) = V'c {'^^i,i) = for any i 7^ / (since Uij = ttZ {(j)i — (pi) = 
2iT \i — l\). The number of grid points G is not a free variable, because the grid points ipi-c must satisfy 
(l25l) or (l34l ). For instance, in the example above, (pi-c must be a uniform grid of 2/Z-spaced points 
between [—1, 1], and, assuming that Z is an integer, the number of grid points is G = Z + I. 

The dependence between the number of grid points G and the virtual array aperture Z can be understood 
by noticing that both ( |25] ) and ( [34l ) impose that grid points are placed at the zeros of the characteristic 
function of the relative random variable (i.e. the sine function). The spacing of the zeros is dictated by 
the virtual array aperture Z. The bigger the aperture the more grid points fit in the range [—1, 1]. 

Summarizing, choose (pi-c as a uniform grid of 2/Z-spaced points in the range [—1, 1]. Then: 

1) If both C and ^ are uniformly distributed, relations (|25i hold, and we can invoke Theorem [1] (for 
uniform recovery); 

2) If either ^ or ,^ are uniformly distributed, relation (|34l holds, and we can invoke Theorem |3] (for 
non-uniform recovery). 

Note that non-uniform recovery, i.e., (|34] i. requires only one density function, say p{C), to be uniform, 
while the other distribution, p (^), can be arbitrarily chosen, e.g., it can be even determrnistically dependent 
on Q. For instance, ( [34l ) is satisfied in a MIMO radar system with N transceivers, i.e., when Qi-n are 
i.i.d. uniform distributed and we deterministically set ^„ = C„. 
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Fig. 2. Empirical ccdf of the coherence of the measurment matrix A and its upper bound as a function of the number of 
elements, (a) considers the AI transmitters and TV receivers setup and the upper bound is given in ( 129) : (b) considers the A^ 
tranceivers setup and the upper bound is given in ( 130) . 



V. Numerical Results 

In this section, we present numerical results to illustrate the proposed spatial compressive sensing 
framework. 

We design an example to follow Theorem \T\ in which p (^) and p {Q are both uniform distributions, 
and (l)i:G represents a uniform grid of 2/Z-spaced points in the interval [—1,1], which implies that the 
number of grid points is G = Z + 1. The system transmits a total of P pulses. Each pulse consists of M 
orthogonal spread spectrum waveforms of length M chips. The waveforms (expressed in discrete form) 
were chosen as the rows of the M x M Fourier matrix. Equal length apertures were assumed for the 
transmit and receive arrays, i.e., Ztx = Zrx = Z/2. The target gains were given by Xk = exp {—jipk), 
with ifj^ drawn i.i.d., uniform over [0, 2it), for all k = 1, ..., K (where K is the number of targets). The 
noise (see (IT3] )) was assumed to be distributed as vec (E) ~ CM (O, a^l) and the SNR is defined as 
— lOlogxo o"^- From the definition of the measurement matrix A, its columns all have squared-norm equal 
to MN. Throughout the numerical results, we normalize the columns of A to have unit norm. 

We first investigate the statistics of the matrix Q discussed in Section |llll In particular, we analyze 
the coherence fi of the measurement matrix A compared to the result given in Theorem [1] The virtual 
aperture was Z = 250 (thus G = 251). In Fig. |Vl we plot the ccdf of the coherence jj,, i.e. Pr {fi > q), 
as a function of the number of elements for (a) the M transmitters and N receivers setup and (b) the 
N transceivers setup. As a reference, we also plot the upper bound given in ( |29l ) and ( [30l ). respectively. 
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It can be seen how the upper bound becomes tighter and tighter as the number of elements increases. 
In addition, it is interesting to notice that the coherence of the matrix A for the A^' transceivers setup 
is very close to the coherence of the matrix A for the the M transmitters and N receivers setup when 
M = N = N'/2. 

We next present localization performance using practical algorithms. We implemented target local- 
ization using LASSO following the algorithm proposed in lITSi to solve problem dTSl ). In addition, we 
implement Beamforming, OLS, OMP, CoSaMP, FOCUSS f295 and MBMP Concerning MBMP, it requires 
as input a K length branch vector d, which define the algorithm's complexity (see ||25l for details on 
setting parameters for MBMP). The output of MBMP is the estimated support. We define the support 
recovery error when the estimated support does not coincide with the true one. For algorithms that return 
an estimate x of the sparse vector x (e.g., LASSO and FOCUSS), the support was then identified as the 
K largest modulo entries of the signal x. 

Analyzing how to set the noise parameter a in (IT4b . or the sparsity K, without prior information is the 
topic of current work [30], but outside the scope of this paper. Therefore in the following, we assume 
that the noise level is available and that the number of targets K is known (notice that this information 
is needed by all the algorithms including MBMP). The virtual aperture was Z = 250 (thus G = 251), 
and tests were carried out ior K = 5 targets. The SNR was 20 dB throughout. 

We first consider the non-uniform guarantee setting. Monte Carlo simulations were carried out using 
independent realizations of target gains, targets locations, noise and element positions. Fig. |V] illustrates 
the probability of support recovery error as a function of the number of measurements AIN. It is apparent 
that beamforming is not suited for the sparse recovery framework. Moreover, two main groups appear: on 
one side, OLS and OMP, which both have practically the same performance, on the other side, Ll-norm 
opt., CoSaMP, FOCUSS and MBMP. Among the latter group, it is important to point out that, although 
the recovery guarantee established in Theorem 5] requires the solution of dTSl ). and thus using LASSO, 
MBMP provides a viable and competitive way to perform target localization. 

We next consider the uniform guarantee setting. In this setup, we first generate a realization of the 
matrix A by drawing at random the element positions. Maintaining the matrix A fixed, we perform 500 
Monte Carlo simulations using independent realizations of target gains, targets locations and noise. For 
each considered recovery method, we defined a support recovery error if an error occur in any of the 500 
simulations. We then average throughout element positions realizations. Fig. |V] illustrates the probability 
of support recovery error as a function of the number of measurements MN. The difference among 
OLS/OMP and the more sophisticated methods (i.e., LASSO, CoSaMP, FOCUSS and MBMP) is even 
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Fig. 3. Probability of support recovery error as a function of the number of rows MN of A. Non-uniform setup. The system 
settings are Z = 250, G = 251 and iC = 5 targets with l^fcl = 1 for all k. The SNR is 20db. 



1 
Beamforming 




10" 



Fig. 4. Probability of support recovery eiTor as a function of the number of rows MN of A. Uniform setup. The system 
settings are Z = 250, G = 251 and A' = 5 targets with jxfc| = 1 for all k. The SNR is 20db. 



more evident in this setup (e.g. at MN = 81, the probability of OLS/OMP error is greater then 0.1), 
confirming the theoretical finding ||3T] of OLS/OMP unfitness to deliver uniform recovery. On the other 
hand, MBMP, an extension of OLS, still provides competitive performance. In particular, MBMP with 
d = [3, 3, 3, 3, 1] outperform the other methods. 
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VI. Conclusions 

We propose a sparse framework to address the source localization problem for a random array MIMO 
radar system. We link system design quantities, i.e., the probability distributions p{^) and p{C) of the 
tx/rx sensors location and the sparse localization grid points (pi-c, with the statistics of the Gram matrix 
Q and the related coherence of the matrix A. Based on this result, we were able to develop uniform 
and non-uniform recovery guarantees for spatial compressive sensing. We show that within the proposed 
framework, it is possible to localize K targets using about AIN = K (log G) MIMO radar noisy 
measurements, where G is proportional to the array aperture and determines the angle resolution. This 
is in contrast with a filled virtual MIMO array for which the product MN scales linearly with G. In 
other words, the proposed framework enables high resolution with a reduced number of MIMO radar 
elements. Moreover, since the results characterize the product of the number of transmit and receive 
elements, MIMO random array implementation further reduces the total number of antenna elements 
needed. 

VII. Appendix 
A. Proof of Proposition [7] 

1) Mean: The mean rj {u) is by definition the expectation of the random array pattern, i.e., E [/3 (n)], 
over Zmn = Cm + Cn- The expectation and the summations can be interchanged obtaining 

M N 

r] (u) = J^ X] Z] ^ [^^P U^^rnn)] ■ (38) 

m=l n=l 

Moreover, the average of exp [jui^i {(,„. + Cm)] does not depend on the index n and m, since C,i-]\j are 
identically distributed, and so are ^i-m- By dropping the indexes of Cn and ^m and using z = ^ + (,, we 
have the sum of MN identical terms, divided by MN. Thus rj (u) equals E [exp (juz)], the characteristic 
function of the random variable z. 

2) Variance: Let af (u) = Re/3(n) and (t| (u) = Im/3(u). For brevity of notation, we drop the 
dependency on u. First notice that since p (z) is even, its characteristic function is real, thus so is the 
mean value of the array pattern -q. We also have that E [(Re /3 — r/) Im /?] = E [Re /3 Im f3] — r]E, [Im /3] = 0, 
since the real and imaginary parts are uncorrelated and because E [Im /3] = 0. Next, we need to evaluate 



a? 4 E 



\2 



\2 



(Re (3 — 1]) and cr| = E (Im /3) . In order to derive these quantities, we consider the 



expectations given by E 



(/3 - r]) and E [|/3 - r]\'^] . It can be shown that. 



E 



iP-riY =at-ai+j2ai2 (39) 
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and 



IE[|/3-r?| 



fJi + (72 . 



(40) 



Substituting the definition of the random array pattern /3 (n) (ITSl l and (|2TI ) in ( [39l) and (|40l ). we obtain 
(Ell) and (|23l). 



B. Proof of Lemma [7] 

From dTSl ) we have that af^a; = MA^ • /3 (««,;), where Ujj = vrZ (0j — (/>;). When 0i:G' is a uniform 
grid, (/)j — (f)i is constant whenever i — / is constant, i.e., along every diagonal of the matrix Q. Since 
/? (uj i) depends only on the term (pi — (pi (not on the actual (pi and (pj), it follows that Q is a Toeplitz 
matrix. 

C. Proof of Theorem [7] 



We define the array pattern associated with the transmitter as 



A 1 ^ 1 



and with the receiver arrays as: 



n=l 



M 



m=l 



(41) 



(42) 



Statistical properties of random arrays were analyzed in [5 | in the case of passive localization (i.e., an 
array with only receiving elements). The following lemma customizes useful results from |)5l: 

Lemma 2. Let the locations C,i-n of the receiving array be i.i.d., drawn from an even distribution p (Q and 
consider a given u. Then j3q (u) is asymptotically jointly Gaussian distributed (we neglect the dependency 
on u): 






'M 



Reip( 



0-? 



a^ 



(43) 



where a\ (n) = ^^ [1 + ^c (2n)] - ^V^ (^) «"^ ^2 (^) = 2^ [1 - ^C (2^^)]- 



Proof: See El. ■ 

The joint distribution of Re /3^ (u) and Im /3g (u) can be obtained similarly. 

For a given i G {2, . . . , G}, using Lemma [T] and the assumption that the mean patterns of both the 
transmitter and receiver arrays satisfy (E5l . i.e., f/-"^ (^i.i) = "0^ (2ui,j) = V'c (^i,«) = V'c (2wi,j) = 0, 
we have that, for both transmitter and receiver arrays, the array pattern evaluated at any grid point is 
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being drawn from an asymptotically complex normal distribution with variance defined by the number 
of transmit and receive elements, i.e., P^ (ui^i) ~ CAf(0,jj) and /3^ (ni,j) ~ CJ\f [O, jj^). It follows 
that the random variable q = jj \h^hi\ can be approximated as belonging to Rayleigh distribution, i.e., 
p{q) = ('Z/o'^) exp (— g^/2cr^), where a"^ = 1/ {2N), and similarly the random variable -^ |c{^Cj| is 
governed by a Rayleigh distribution with variance a^ = 1/ (2Af). 

If ^ and C are independent (part 1), the two random variables -^ |bf^bj| and -^ |c{^Cj| are independent. 
Using (l24l ). we have that the distribution of -^^^^ |£i?aj| is the product of two independent Rayleigh 
distributed variables. The cumulative density function of such a variable is given in ll32l . It follows that 
the ccdf of jjj^ 1^1^ ^2 1 satisfies 

1 ' i/. 



Pr(^^^|afa,|>gJ <x-i^i(x), (44) 

where x = 2y/MNq. 

If ^„ = C,n for all n (part 2), by using (|24] |. we have that -^ |a{^aj| = (-^ |bf^bj|) . Since the random 
variable -^ |bf^bj| has a Rayleigh distribution, -^ l^^i^^il is distributed as the square of a Rayleigh 
distribution, which has cdf 1 — exp {—Nq). As such, its ccdf satisfies 

Pr f ^ |af ail > q] < exp (-Nq) . (45) 

Part 3 follows because, from (l24l ). the phase of a^^aj is the sum of the phases of bf^bj and c{^Cj. In 
the case of transceivers the phase of a(^aj is evidently uniform since it is the same phase of b(^bj. 
In the case of M transmitter and N receivers, since both hfhi and c^Ci are two independent circular 
symmetric complex normal variables, the sum of the phases is itself uniformly distributed over [0,2tt), 
concluding the proof. 

D. Proof of Corollary \J} 

We take the conservative assumption of independence between the G—1 random variables | -^g^af^aj], 
for i = 2 to G. If ^ and ( are independent (part 1), from (|26] |. the ccdf of the maximum among G — 1 
such variables (which gives the coherence), is upper bounded by 

1 



Pr I max 



MN ^ 



G-l 



>q] <l-[l-x-Ki (x)]^-^ , (46) 



where x = 2VMNq. 

If Cn = Cn for all n (part 2), by using (|27| i. the ccdf of the maximum among G—1 such variables, is 
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upper bounded by 

Pr I max 

This concludes the proof. 






> g 1 < 1 - [1 - exp {Nq)f-^ . (47) 



E. Proof of Theorem |2] 

The theorem follows by combining the claims of Theorem 2.7 in ll28l and Corollary [T] In particular, 
Theorem 2.7 in ||28l provides stable recovery guarantees for all -fC-sparse signal if the measurement 
matrix A has RIP 52K <2/ h,+ ^/fjt) = a. 

The goal is therefore to bound the RIP 52k of the spatial compressive sensing measurement matrix A 
with probability higher then 1 — e. In other words, we want to find how many measurements MN we need 
to satisfy Pr {62K < a) > 1-e. By using 62K < (2K - 1) /U |28], we have that 62K <aif {2K - I) fi < 
a. Moreover, the condition Pr {{2K — 1) /x < a) > 1 — e is equivalent to Pr (/x > a/ {2K — 1)) < e. 
Therefore by invoking (|29l ) in Corollary [T] we can write 

Pr {52K >a)<l-[l-x-Ki {x)f-^ (48) 

where, by combining x = 2\/MNq and q = a/ {2K — 1), we have x = 2\^MNa/ {2K — 1). Our goal 
is to find the value of MN that makes the right hand-side equal to e. 

We first approximate the modified Bessel function of the second kind Ki (x) for a large absolute value 
and small phase of the argument (in our setting, the argument x is real) ll33l : Ki (x) ^ sj^exp {—x). We 
thus would like to enforce 1 — [l — ^/^exp (—3;)] = e. Defining t = y^^exp (— x) and linearizing 
the function (1 — t) ~ around t = 0, we obtain GV^exp (— x) = e, where, for simplicity, we used G 
in place of G — 1. This equation can be rewritten in the form —2x exp (—22;) = —7"^, where 7 = ^^^. 
The inverse function of such equation is called the Lambert W function |34|. For real arguments, it is 
not injective, therefore it is divided in two branches: x > 1/2 or x < 1/2. Since in our setup x < 1/2, 
the lower branch, denoted W-i, is considered, and our solution satisfies —2x = W-i (— 7"^). By using 
the asymptotic expansion W-i (— 7~^) ~ — 2 In 7 — In (2 In 7) and solving for MN we obtain (1311 ). The 
claim of the theorem follows from Theorem 2.7 in ||28l . Finally, since in this work we consider iC-sparse 
signals, in the error term (l32i . we discarded the term for nearly-sparse signals present in 



F. Proof of Theorem \3\ 

Because the variables Ci:Ar are identically distributed, and so are ^i:j\/, the average E [A^ (t, :) A (t, :)] 
does not depend on the index t = N {m — 1) + n, where the last relation follows from the definition of 
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a^. Therefore, we have E [A^ (t, :) A (t, :)] = j^ Y.tL^ ^ [^" (t, A {t, :)] = ^E [Q]. Thanks to 
Lemma [H we can focus only on the first row of j^]^^ [Q]- Using (fTSl ). the element of the first row of 
such matrix are rj (mi,j) for i = 1, . . . , G. From (|2T| | in Proposition [1] we know that r] (ui j) = t/j^ {ui,i)- 
Thus, requiring (|34] |. i.e., ipz {ui,i) = for z = 2, . . . , G, together with the fact that exp (jzui^i) = 1 
(because ni i = irZ {(pi — (pi) = 0), gives the "if direction of the claim. 

The "only if direction follows by noticing that when (|34] | is not satisfied there will be at least one i 
such that T] (ui^i) 7^ 0. Therefore, the matrix A does not satisfy the isotropy property, showing that (|34] i 
is also a necessary condition. 
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